\ 



1/102 



FIGURE 1A 
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FIGURE 7 



CELL LINE/TYPE 


11 p1 1.2-13 
REGION 


METASTATIC 


PSM RNA 
Ut I to I tu 


PSM DNA 
ut I tu I tu 


LNCap 






+ + 


ND 


HUMAN 
PROSTATE 






+ + 


ND 


A9 

(FIBROSARCOMA) 


NO 


NO 






A9(11) 

(A9+HUM. 11) 


YES 


NO 


— 


REPEAT 


AT6. 1 

(RAT PROSTATE) 


NO 


YES 






AT6.1-11-C11 


YES 


NO 


+ 


-I- + 


AT6.1-11-C12 


NO 


YES 


— 


— ■ 


R1564 

/DAT M A M M A DV\ 

(HAI MAMMAnY) 


NO 


YES 


— 


— 


R1564-11-C14 


YES 


YES 




+ 


R1564-11-C15 


YES 


YES 




REPEAT 


R1564-11-C16 


YES 


YES 




ND 


R1564-11-C12 


YES 


YES 


ND 


+ 



8/102 



FIGURE 8A 
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FIGURE 9A 
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FIGURE 14 

Patient Stage 
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FIGURE ISA 

10 20 30 40 50 60 

I I I I I 1 

1 GCGCCTTAAA AAAAAAAAAC TTTCTTGGAA AATGTCCAGC TCTTGCTTAA ATATAAAAAT 

CGCGGAATTT TTTTTTTTTG AAAGAACCTT TTACAGGTCG AGAACGAATT TATATTTTTA 



61 GAAAGGAAGA AAGAGACTCT CCTCTCTCCA CTCCTATAAT TATGAGGAAC TTTTATTCAA 
CTTTCCTTCT TTCTCTGAGA GGAGAGAGGT GAGGATATTA ATACTCCTTG AAAATAAGTT 



121 CTCTGAAATT CTATACAATC TCTACAATAC TCTACTGAAT AAAAG C AG AG CAGAAAAAGC 
GAGACTTTAA GATATGTTAG AGATGTTATG AGATGACTTA TTTTCGTCTC GTCTTTTTCG 



181 TGCGCTTTTT TTCCATAGTC GGGAATGCTT GTCATCAGTG TAAATCACCA CCGCGCCCTT 
ACGCGAAAAA AAGGTATCAG CCCTTACGAA CAGTAGTCAC ATTTAGTGGT GGCGCGGGAA 



241 TTTCCTAAAG AATATTATTG TTATTAATAA ACATGTAGGG TATTATCCTC CACTTACATT 
AAAGGATTTC TTATAATAAC AATAATTATT TGTACATCCC ATAATAGGAG GTGAATGTAA 



3 01 ACAAAACCAT TTTTTAAAGC CGGGCGTGGT GGCTCACGCC TGTAATCCCA GCACTTTGGG 
TGTTTTGGTA AAAAATTTCG GCCCGCACCA CCGAGTGCGG ACATTAGGGT CGTGAAACCC 



3 61 AGGCCCAGAC AGGCGGATCA CGAAGTCGAG AAATCGAGAC CATCCTGGCC AACATGGTGA 
TCCGGGTCTG TCCGCCTAGT GCTTCAGCTC TTTAGCTCTG GTAGGACCGG TTGTACCACT 



4 21 AACCCCATCT CTACTAAAAA TACAAAAATT AGCTGGGCGT GGTGGCGGGC TCCTGTAGTC 
TTGGGGTAGA GATGATTTTT ATG TTTTTAA TCGACCCGCA CCACCGCCCG AGGACATCAG 



4 81 CCAGCTACTC AGGAGG CTGA GGCAGGAGAA TCGCTTGAAC CGGGGAGGCG GAGGTTGCAG 
GGTCGATGAG TCCTCCGACT CCGTCCTCTT AGCGAACTTG GCCCCTCCGC CTCCAACGTC 



541 TCAGCCAAGA TAGCGCCACT GCACTGGAGC CTGGTGACAG AGTGAGACTC CCTCAAGAAA 
AGTCGGTTCT ATCGCGGTGA CGTGACCTCG GACCACTGTC TCACTCTGAG GGAGTTCTTT 



601 GAAAGGAAGG GAAGGGAAAG GGAAGGAAGG GGAGGGGAAG GGAGGGGAGG GGAGGGGAGG 
CTTTCCTTCC CTTCCCTTTC CCTTCCTTCC CCTCCCCTTC CCTCCCCTCC CCTCCCCTCC 



661 AAAGAAAAGA ATACTGGAAC TTGTTGAAGG CAGAGACTTT ATTTTCATAT CCCGGCTATG 
TTTCTTTTCT TATGACCTTG AACAACTTCC GTCTCTGAAA TAAAAGTATA GGGCCGATAC 



721 TCTGGCTACT GTCTTACGTA ATAGATATAA AATCAATCTT GGTTGGATTA ACCAGAAGAA 
AGACCGATGA CAG AATG CAT TATCTATATT TTAGTTAGAA CCAACCTAAT TGGTCTTCTT 
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FIGURE 15B 



781 TCAGAAGATA TATTCTGGTA AGTTGAATAC TTAGCACCCA GGGGTAATCA GCTTGGACAG 
ACTCTTCTAT ATAAGACCAT TCAACTTATG AATCGTGGGT CCCCATTAGT CGAACCTCTC 



841 GACCAGGTCC AAAGACTGTT AAGAGTCTTC TGACTCCAAA CTCAGTGCTC CCTCCAGTGC 
CTGGTCCAGG TTTCTGACAA TTCTCAGAAG ACTGAGGTTT GAGTCACGAG GGAGGTCACG 



901 CACAAGCAAA CTCCATAAAG GTATCCTGTG CTGAATAGAG ACTGTAGAGT GGTACAAAGT 
GTGTTCGTTT GAGGTATTTC CATAGGACAC GACTTATCTC TGACATCTCA CCATCTTTCA 



9 61 AACACAGACA TTATATTAAG TCTTAGCTTT GTGACTTCGA ATGACTTACC TAATCTAGCT 
TTCTGTCTGT AATATAATTC AGAATCGAAA CACTGAAGCT TACTGAATGG ATTACATCGA 



1021 AAATTTCAGT TTTACCATGT GTAAATCAGG AAGAGTAATA GAACAAACCT TGAAGGGTCC 
TTTAAACTCA AAATGGTACA CATTTAGTCC TTCTCATTAT CTTGTTTGGA ACTTCCCAGG 



1081 CAATGGTGAT TAAATGAGGT GATGTACATA ACATGCATCA CTCATAATAA GTGCTCTTTA 
GTTACCACTA ATTTACTCCA CTACATGTAT TGTACGTAGT GAGTATTATT CACGAGAAAT 



1141 AATATTAGTC ACT ATT ATT A GCCATCTCTG ATTAGATTTG ACAATAGGAA C ATT AG G AAA 
TTATAATCAG TGATAATAAT CGGTAGAGAC TAATCTAAAC TGTTATCCTT GTAATCCTTT 



1201 GATATAGTAC ATTCAGGATT TTGTTAGAAA GAGATGAAGA AATTCCCTTC CTTCCTGCCC 
CTATATCATG TAAGTCCTAA AACAATCTTT CTCTACTTCT TTAAGCGAAG GAAGGACGGG 



12 61 TAGGTCATCT AGGAGTTGTC ATGGTTCATT GTTGACAAAT TAATTTTCCC AAATTTTTCA 
ATCCAGTAGA TCCTCAACAG TACCAAGTAA CAACTGTTTA ATTAAAACCG TTTAAAAAGT 



1321 CTTTGCTCAG AAAGTCTACA TCGAAGCACC CAAGACTGTA CAATCTAGTC CATCTTTTTC 
GAAACGAGTC TTTCAGATGT AGCTTCGTG3 GTTCTCACAT G TT AG AT C AG GTAGAAAAAG 



1381 CACTTAACTC ATACTGTGCT CTCCCTTTCT CAAAGCAAAC TGTTTGCTAT TCCTTGAATA 
GTGAATTGAG TATGACACGA GAGGGAAAGA GTTTCGTXTG ACAAACGATA AGGAACTTAT 



14 41 CACTCTGAGT TTTCTGCCTT TGCCTACTCA GCTGGCCCAT GGCCCCTAAT GTTTCTTCTC 
GTGAGACTCA AAAGACOGAA ACGCATGAGT C3ACCGGGTA CCGGGGATTA CAAAGAAGAG 



1501 ATCTCCACTG GGTCAAATCC TACCTGTACC TTATGGTTCT GTTAAAAGCA GTGCTTCCAT 
TAGAGGTGAC CCAGTTTAGG ATGGACATGG AATACCAAGA CAATTTTCGT CACGAAGGTA 



1561 AAAGTACTCC TAGCAAATGC ACGGCCTCTC TCACGGATTA TAAGAACACA GTTTATTTTA 
' TTTCATGAGG ATCGTTTACG TGCCGGAGAG AGTGCCTAAT ATTCTTGTGT CAAATAAAAT 



1621 




1681 




1741 




1801 




1861 




1 7/1 02 

FIGURE 15C 1 /AU 

1921 GCAGTGGCGC TATCTTGGCT CACTGCAACC TCCGCCTCCC CGGTTCAAGC GATTCTCCTG 
CGTCACCGCG ATAGAACCGA CTGACGTTGG AOGCGGAGCG GCCAAGTTCG CTAAGAGGAC 

1981 CCTCAOCCTC CTGAOTAOCT GGGACTACAG GAGCCCGCCA CCACGCCCAG CT^TTTTTG 
GGAGTCGGAG GACTCATCCA CCCTGATGTC CTCGGGCGGT GOTGCGGGTC GATTAAAAAC 

2041 TATTTTTAGT AGAGATGGGG TTTCACCATG TTGGCCAGGA TGGTCTCGAT ^CTCGACTT 
ATAAAAATCA TCTCTACCCC AAAGTGGTAC AACCGGTCCT ACCAGAGCTA AAGAOCTGAA 

2101 CCTGATCCGC CTGTCTGGGC CTCCCAAACT GCTGGGATTA CAOGCGTGAG CCACCACGCC 
GCACTAGGCG GACAGACCCG GAGGGTTTCA CGACCCTAAT GTCCGCACTC GGTG GTGCGG 

2161 CGGCTTTAAA AAATGGTTTT GTAATGTAAG TGGAGGATAA TACCCTACAT GTTTATOAAT 
GCCOAAATTT TTTACCAAAA CATTACATTC ACCTC CTATT ATGGGATGTA CAAATAATTA 

2 221 AACAATAATA TTCTTTAGGA AAAAGGGCGC GGTGGTGATT TACACTGATG ACJAGCATTC 
TTGTTATTAT AAGAAATCCT TTTTCCCGCG CCACCACTAA ATG TG ACT AC TGTTCGTAAG 

2281 CCGACTATGG AAAAAAAGCG CAGCTTTTTC TGCTCTGCTT TTATT CAGTA 3AGTATOGTA 
GGCTGATACC TTTTTTTCGC GTCGAAAAAG ACGAGACGAA AATAAGTCAT CTCATAACAT 

2341 GAGATTGTAT AGAATTTCAG AGTTGAATAA AAGTTCCTCA TAATTATAGG AGTGGAGAGA 
CTCTAACATA TCTTAAAGTC TCAACTTATT TTCAAGGAGT ATTAATATCC TCACCTCTCT 

2401 GGAGAGTCTC TTTCTTCCTT TCATTTTTAT ATTTAAGCAA GAGCTGGACA TTTTCCAAGA 
CCTCTCAGAG AAAGAAGGAA AGTAAAAATA TAAATTCOTT CTCGACCTGT AAAAGGTTCT 

2 4 61 AAGTTTTTTT TTTTTAAGGC GCCTCTCAAA AGGGGCCGGA TTTCCTTCTC CTGGAGGCAG 
TTCAAAAAAA AAAAATTCCG CGGAGAGTTT TCCCCCGCCT AAAGGAAGAG GACCTCCGTC 

2521 ATGTTGCCTC TCTCTCTCGC TCGGATTGGT TCAGTGCACT CTACAAACAC TGCTOTGGTG 
TACAACGGAG AG A GAG AG CG AGCCTAACCA AGTCACGTGA GATCTTTQTO ACGACACCAC 

2581 GAGAAACTGG ACCCCAGCTC TGGAGCGAAT TCCAGCCTGC AGGGCTGATA AGCGAGGCAT 
CTCTTTGACC TGGGGTCCAG ACCTCGCTTA AGGTCGGACG TCCCGACTAT TCGCTCCOTA 

2641 TAGTGAGATT GAOAGAGACT TTACCCCGCC GTGGTGGTTG GAGGGCGCGC AGTAGAGCAG 
ATCACTCTAA CTCTCTCTGA AATGGGGCGG CACCACCAAC CTCCCGCGCG TCATCTCOTC 

2701 CACCACAGGC CCGGGTCCCG GGAGGCCGGC TCTGCTCGCG CCGAGATGTG GAATCTCCTT 
GTCGTGTCCQ CGCCCAOGGC CCTCCGGCCG AGACGAOCOC GGCTCTACAC CTTAGAGGAA 

2761 CACOAAACCG ACTCOGCTGT GGCCACCGCG CGCCGCCCGC^ GCTGGCTGTG CGCTGGGGCC 
GTCCrTTGGC TOAGCCOACA CCGGTGGCGC OCGGCGGGCG* CCACCGACAC GCGACCCCGC 

2821 CTGGTGCTGG CGGOTGOCTT CTTTCTCCTC CGCTTCCTCT TCGGTAGGGG GGCGCCTCGC 
GACCACGACC GCCCACCGAA OAAAGAGGAG CCGAAGGAGA AGCCATCCCC CCGCGGAGCG 

2881 OGAGCAAACC TCGGAGTCTT CCCCGTGGTG CCGCGGTGCT GGGACTCGCG GGTCAGCTGC 
CCTCGTTTGG AGCCTCAGAA GGGGCACCAC GGCCCCACGA CCCTGAGCGC CCAOTCGACG 

2 941 CGAGTGGGAT CCTGTTG C T G GTCTTCCCCA GGGGCGGCGA TTAGGGTCGG GGTAATGTGG 

GCTCACCCTA GGACAACGAC CAGAAGGGCT CCCCGCCGCT AATCCCAGCC CCATTACACC 

3 001 GGTGAGCACC CCTCGAG 

CCACTCGTGG GGAGCTC 
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FIGURE 15D 



24 01 GGACACTCTC TTTCTTCCTT TCATTTTTAT ATTTAAGCAA GAGCTGGACA TTTTCCAvAGA 
CCTCTCAGAG AAAGAAGGAA AGTAAAAATA TAAATTCGTT CTCGACCTGT AAAAGGTTC7 

2461 AAG TTTTTTT TTTTTAAGGC GCCTCTCAAA AGGGGCCGGA TTTCCTTCTC CTGGAGGCAG 
TTCAAAAAAA AAAAATTCCG CGGAGAGTTT TCCCCGGCCT AAAGGAAGAG GACCTCCGTC 

•2521 ATGTTGCCTC TCTCTCTCGC TCGGATTGGT TCAGTGCACT CTAGAAACAC TGCTGTGGTG 
TACAACGGAG AGAGAGAGCG AGCCTAACCA ACTCACGTGA GATCTTTCTG ACGACACCAC 

2 5c 1 GAGAAACTGG ACCCCAGGTC TGGACCGAAT TCCAGCCTGC AGGGCTGATA AGCGAGGCAT 
CTCTTTGACC TGGGGTCCAG ACCTCGCTTA AGGTCGGACG TCCCGACTAT TCGCTCCGTA 



2 641 TAGTGACATT GAGAGAGACT TTACCCCGCC GTGGTGGTTG GAGGGCGCGC AGTAGAGCAC 

ATCACTCTAA CTCTCTCTGA AATGGGGCGG CACCACCAAC CTCCCGCGCG TCATCTCGTC 

2 7 01 CAGCACAGGC GCGGGTCCCG GGUGCCCGGC TCTGCTCGCG CCGAGATGTG GAATCTCCTT 

GTCGTGTCCG CGCCCAGGGC CCTCCGGCCG A3ACGAGCGC GGCTCTACAC CTTAGAGGAA 

27 61 CACGAAACCG ACTCGGCTGT GGCCACCGCG CGCCGCCCGC GCTGGCTGTC CGCTGGCGCG 

GTGCTTTGGC TGAGCCGACA CCGGTGGCCC GCGGCGGGCC CCACCGACAC GCGACCCCGC 

2 821 CTCGTCCTGG CGGGTCGCTT CTTTCTCCTC GGCTTCCTCT TCGGTAGGGG GCCGCCTCGC 

GACCACGACC GCCCACCGAA GAAAGAGGAG CCGAAGGAGA AGCCATCCCC CCGCGGAGCC 

28 81 GGAGCAAACC TCGGACTCTT CCCCGTGGTG CCGCGGTGCT GGCACTCGCG GGTCAGCTGC 

CCTCGTTTGG AGCCTCAGAA GGGGCACCAC GGCGCCACGA CCCTGAGCGC CCAGTCGACG 



2941 CGAGTGGGAT CCTCTTCCTG GTCTTCCCCA GGGGCGGCGA TTAGGGTCGG GGTAATGTGG 
GCTCACCCTA GGACAACGAC CAGAAGGGGT CCCCGCCGCT AATCCCAGCC CCATTACACC 



3 001 GGTGAGCACC CCTCGAG 
CCACTCGTGG GGAGCTC 
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Potential binding sites on the PSM promoter* 



FIGURE 16 

Site Seq "Location 

AP1 TKAGTCA -1145 

E2-RS ACCNNNNNNGGT-1940 

-1951 

GHF NNNTAAATNNN 



JVC repeat GGGNGGRR 



uteroglobi RYYWSGTG 



#nt matched 

7/7 

12/12 
12/12 

-580 11/11 

-753 11/11 

-1340 11/11 

-1882 11/11 

-1930 11/11 

-1979 11/11 

-2001 11/11 

-2334 11/11 

" 2374 11/11 

- 2591 11/11 
" 2620 11/11 

- 2686 11/11 

-1165 8/8 

-1175 8/8 

-1180 8/8 

-1185 8/8 

-1190 8/8 

10/10 

- 250 8/8 

- 921 8/8 

- 1104 8/8 



NFkB GGGRHTYYHC -961 



IFN AAWAANGAAAGGR590 13/13 Cell 41:509 (1985) 
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FIGURE 31 

Prostate Specific Promoter: 
Cytosine Deaminase Chimera 
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FIGURE 32A 



10 20 30 40 50 60 

1 AAGGGTGCTC CTTAGGCTGA ATGCTTGCAG ACAGGATGCT TGGTTACAGA TGGGCTGTGA 
TTCCCACGAG GAATCCGACT TACGAACGTC TGTCCTACGA ACCAATGTCT ACCCGACACT 



61 CTCGAGTGGA GTTTTATAAG GGTGCTCCTT AGGCTGAATG CTTGCAGACA GGATGCTTGG 
GAGCTCACCT CAAAATATTC CCACGAGGAA TCCGACTTAC GAACGTCTGT CCTACGAACC 



121 TTACAGATGG GCTGTGAGCT GGGTGCTTGT AAGAGGATGC TTGGGTGCTA AGTGAGCCAT 
AATGTCTACC CGACACTCGA CCCACGAACA TTCTCCTACG AACCCACGAT TCACTCGGTA 



181 TTGCAGTTGA CCCTATTCTT GGAACATTCA TTCCCCTCTA CCCCTGTTTC TGTTCCTGCC 
AACGTCAACT GGGATAAGAA CCTTGTAAGT AAGGGGAGAT GGGGACAAAG ACAAGGACGG 



241 AGCTAAGCCC ATTTTTCATT TTTCTTTTAA CTCCTTAGCG CTCCGCAAAA CTTAATCAAT 
TCGATTCGGG TAAAAAGTAA AAAGAAAATT GAGGAATCGC GAGGCGTTTT GAATTAGTTA 



301 TTCTTTAAAC CTCAGTTTTC TTATCTGTAA AAGGTAAATA ATAATACAGG GTGCAACAGA 
AAGAAATTTG GAGTCAAAAG AAT AG A C ATT TTCCATTTAT TATTATGTCC CACGTTGTCT 



361 AAAATCTAGT GTGGTTTACA TAATCACCTG TTAGAGATTT TAAATTATTT CAGGATAAGT 
TTTTAGATCA CACCAAATGT ATTAGTGGAC AATCTCTAAA ATTTAATAAA GTCCTATTCA 



421 CATGATAATT AAATGAAATA ATGCACATAA AG C A C AT AGT GTGGTGTCCT CCATATAGAA 
GTACTATTAA TTTACTTTAT TACGTGTATT TCGTGTATCA CACCACAGGA GGTATATCTT 



481 AATGCTCAGT ATATTGGTTA TTAACTACTT GTTGAAGGTT TATCTTCTCC ACTAAACTGT 
TTACGAGTCA TATAACCAAT AATTGATGAA CAACTTCCAA ATAGAAGAGG TGATTTGACA 



541 AAGTTCCACA AGCCTTACAA TATGTGACAG ATATTCATTC ATTGTCTGAA TTCTTCAAAT 
TTCAAGGTGT TCGGAATGTT ATACACTGTC TATAAGTAAG TAACAGACTT AAGAAGTTTA 



601 ACATCCTCTT CACCATAGCG TCTTATTAAT TGAATTATTA ATTGAATAAA TTCTATTGTT 
TGTAGGAGAA GTGGTATCGC AGAATAATTA ACTTAATAAT TAACTTATTT AAGATAACAA 



661 CAAAAATCAC TTTTATATTT AACTGAAATT TGCTTACTTA TAATCACATC TAACCTTCAA 
GTTTTTAGTG AAAATATAAA TTGACTTTAA ACGAATGAAT ATTAGTGTAG ATTGGAAGTT 



721 AGAAAACACA TTAACCAACT GTACTGGGTA ATGTTACTGG GTGATCCCAC GTTTTACAAA 
TCTTTTGTGT AATTGGTTGA CATGACCCAT TACAATGACC CACTAGGGTG CAAAATGTTT 



36/102 



FIGURE 32B 



781 TGAGAAGATA TATTCTGGTA AGTTGAATAC TTAGCACCCA GGGGTAATCA GCTTGGACAG 
ACTCTTCTAT ATAAGACCAT TCAACTTATG AATCGTGGGT CCCCATTAGT CGAACCTGTC 



841 GACCAGGTCC AAAGACTGTT AAGAGTCTTC TGACTCCAAA CTCAGTGCTC CCTCCAGTGC 
CTGGTCCAGG TTTCTGACAA TTCTCAGAAG ACTGAGGTTT GAGTCACGAG GGAGGTCACG 



9 01 CACAAGCAAA CTCCATAAAG GTATCCTGTG CTGAATAGAG ACTGTAGAGT GGTACAAAGT 
GTGTTCGTTT GAGGTATTTC CATAGGACAC GACTTATCTC TGACATCTCA CCATGTTTCA 



961 AAGACAGACA TTATATTAAG TCTTAGCTTT GTGACTTCGA ATGACTTACC TAATCTAGCT 
TTCTGTCTGT AATATAATTC AGAATCGAAA CACTGAAGCT TACTGAATGG ATTAGATCGA 



1021 AAATTTCAGT TTTACCATGT GTAAATCAGG AAGAGTAATA GAACAAACCT TGAAGGGTCC 
TTTAAAGTCA AAATGGTACA CATTTAGTCC TTCTCATTAT CTTGTTTGGA ACTTCCCAGG 



1081 CAATGGTGAT TAAATGAGGT GATGTACATA ACATGCATCA CTCATAATAA GTG CTCTTTA 
GTTACCACTA ATTTACTCCA CTACATGTAT TGTACGTAGT GAGTATTATT CACGAGAAAT 



1141 AATATTAGTC ACTATTATTA GCCATCTCTG ATTAGATTTG ACAATAGGAA CATTAGGAAA 
TTATAATCAG TGATAATAAT CGGTAGAGAC TAATCTAAAC TGTTATCCTT GTAATCCTTT 



1201 GATATAGTAC ATTCAGGATT TTGTTAGAAA GAGATGAAGA AATTCCCTTC CTTCCTGCCC 
CTATATCATG TAAGTCCTAA AACAATCTTT CTCTACTTCT TTAAGGGAAG GAAGGACGGG 



1261 TAGGTCATCT AGGAGTTGTC ATGGTTCATT GTTGACAAAT TAATTTTCCC AAATTTTTCA 
ATCCAGTAGA TCCTCAACAG TACCAAGTAA CAACTGTTTA ATTAAAAGGG TTTAAAAAGT 



1321 CTTTGCTCAG AAAGTCTACA TCGAAGCACC CAAGACTGTA CAATCTAGTC CATCTTTTTC 
GAAACGAGTC TTTCAGATGT AGCTTCGTGG GTTCTGACAT GTTAGATCAG GTAGAAAAAG 



1381 CACTTAACTC ATACTGTGCT CTCCCTTTCT CAAAGCAAAC TGTTTGCTAT TCCTTGAATA 
GTGAATTGAG TATGACACGA GAGGGAAAGA GTTTCGTTTG ACAAACGATA AGGAACTTAT 



14 41 CACTCTGAGT TTTCTGCCTT TGCCTACTCA GCTGGCCCAT GGCCCCTAAT GTTTCTTCTC 
GTGAGACTCA AAAGACGGAA ACGGATGAGT CGACCGGGTA CCGGGGATTA C AAA G AAG AG 



1501 ATCTCCACTG GGTCAAATCC TACCTGTACC TTATGGTTCT GTTAAAAGCA GTGCTTCCAT 
TAGAGGTGAC CCAGTTTAGG ATGGACATGG AATACCAAGA CAATTTTCGT CACGAAGGTA 



1561 AAAGTACTCC TAG C AAATG C ACGGCCTCTC TCACGGATTA TAAGAACACA GTTTATTTTA 
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FIGURE 32C 

TTTCATGAGG ATCGTTTACG TGCCGGAGAG AGTGCCTAAT ATTCTTGTGT CAAATAAAAT 



1621 TAAAGCATGT AGCTATTCTC TCCCTCGAAA TACGATTATT ATTATTAAGA ATTT AT AG C A 
ATTTCGTACA TCGATAAGAG AGGGAGCTTT ATGCTAATAA TAATAATTCT TAAATATCGT 



1681 GGGATATAAT TTTGTATGAT GATTCTTCTG GTTAATCCAA CCAAGATTGA TTTT AT AT CT 
CCCTATATTA AAA CAT ACT A CTAAGAAGAC CAATTAGGTT GGTTCTAACT AAAATATAGA 



17 41 ATTACGTAAG ACAGTAGCCA GACATAGCCG GGATATGAAA ATAAAGTCTC TGCCTTCAAC 
TAATGCATTC TGTCATCGGT CTGTATCGGC CCTATACTTT TATTTCAGAG ACGGAAGTTG 



1801 AAGTTCCAGT ATTCTTTTCT TTCCTCCCCT CCCCTCCCCT CCCTTCCCCT CCCCTTCCTT 
TTCAAGGTCA TAAGAAAAGA AAGGAGGGGA GGGGAGGGGA GGGAAGGGGA GGGGAAGGAA 



1861 CCCTTTCCCT TCCCTTCCTT T CTTTCTTG A GGGAGTCTCA CTCTGTCACC AGGCTCCAGT 
GGGAAAGGGA AGGGAAGGAA AGAAAGAACT CCCTCAGAGT GAGACAGTGG TCCGAGGTCA 



1921 GCAGTGGCGC TATCTTGGCT GACTGCAACC TCCGCCTCCC CGGTTCAAGC GATTCTCCTG 
CGTCACCGCG ATAGAACCGA CTGACGTTGG AGGCGGAGGG GCCAAGTTCG CTAAGAGGAC 



1981 CCTCAGCCTC CTGAGTAGCT GGGACTACAG GAGCCCGCCA CCACGCCCAG CTAATTTTTG 
GGAGTCGGAG GACTCATCGA CCCTGATGTC CTCGGGCGGT GGTGCGGGTC GATTAAAAAC 



2 041 TATTTTTAGT AGAGATGGGG TTTCACCATG TTGGCCAGGA TGGTCTCGAT TTCTCGACTT 
ATAAAAATCA TCTCTACCCC AAAGTGGTAC AACCGGTCCT ACCAGAGCTA AAGAGCTGAA 



2101 CGTGATCCGC CTGTCTGGGC CTCCCAAAGT GCTGGG ATT A CAGGCGTGAG CCACCACGCC 
GCACTAGGCG GACAGACCCG GAGGGTTTCA CGACCCTAAT GTCCGCACTC GGTGGTGCGG 



2161 CGGCTTTAAA AAATGGTTTT GTAATGTAAG TGGAGGATAA TACCCTACAT GTTTATTAAT 
GCCGAAATTT TTTACCAAAA C ATT AC ATT C ACCTCCTATT ATGGGATGTA CAAATAATTA 



2221 AACAATAATA TTCTTTAGGA AAAAGGGCGC GGTGGTGATT TACACTGATG ACAAGCATTC 
TTGTTATTAT AAGAAATCCT TTTTCCCGCG CCACCACTAA ATGTGACTAC TGTTCGTAAG 



2281 CCGACTATGG AAAAAAAGCG CAGCTTTTTC TGCTCTGCTT TTATTCAGTA GAGTATTGTA 
GGCTGATACC TTTTTTTCGC GTCGAAAAAG ACGAGACGAA AATAAGTCAT CTCATAACAT 



2 341 GAGATTGTAT AGAATTTCAG AGTTGAATAA AAGTTCCTCA TAATTATAGG AGTGGAGAGA 
CTCTAACATA TCTTAAAGTC TCAACTTATT TTCAAGGAGT ATTAATATCC TCACCTCTCT 
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FIGURE 50A 

AXGGTJJUJU^TTATCTCITITTTTCTCTCCCCCAXTGTAAXAXOTTATAO 
JU^iUxU^ATCT Url ' l m ICTCTCCCCCAATGTAAAAACTTATAG 

T?T? Mil 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 i 1 1 1 i i 1 1 1 F 1 1 1 1 1 1 ! 1 1 M 1 1 1 11 

TCOtfiroACATOTGTA^ 

A TT T T CTTCTATT CTGTGACATGCCCAC CTTACAC AQAGGACACATTT AC 

1 1 1 1 1 1 1 1 1 li 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 Ml ] 1 1 1 1 1 IN 1 1 1 Mill II 

ATTTTCTTCTATTCTGTGACATGCCCACCTTAC^ 

XAG^TTATATCCCGOGCTTAAATTCGAGCATrGGAATTTGOCCAG'TCTAC 

I IM M I II 1 1 1 1 M 1 1 1 1 1 r 1 1 II Ml 1 1 Ml II I MM Mill II I II 
taggttatatcccgoggttaaattcgag<^ttggaattt^ 

XTCTTT Xi3ACTGA>CAGAACAAATTrTTCTGT«CT^ 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ I M M 1 II M I M I II II I I 

ATGTTOAGAGTGAACACAACAAAOT 

TGGccTACAAGAAG ^^^^^^^^^TF^TT^^rTTT^^T^T^TfT 
1 1 M I II II M II 1 1 II 1 1 II M M II 1 1 Ml MM JLXXXAiJ4^iiM-i4 

TG<^CTACAAGJLAGCATGCACTGOGTTTATTATrA 

GTTTTAAATATTTT CTACAAAAATGTTOACTAAACT AAATl^TA^T ATC A 
ATOCTATAAATAATCAGG<yUU^ 

rTACTGTCAlTOGATTTGTTAATATATTTTTCTCTTOAGT 

II 1 1 II M 1 1 1 1 1 1 Ml II II II M II I M M II II II Mill 1 1 1 II 1 1 

WACTOTCAmGAmGTTAATATATTTTTCTCW 
ATTTTAAAAAATTCCCTITCGACTGTAGAACAAATA^GAA'ITTGGCCTGT 
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FIGURE 55A 

10 20 30 40 

★ ★ * ★* 

CTC AAA AGG GGC CGG ATT TCC TTC TCC TGG AGG CAG ATG TTG CCT CTC 



50 60 70 80 90 

* * * * * * * * 

TCT CTC GCT CGG ATT GGT TCA GTG CAC TCT AGA AAC ACT GCT GTG GTG 



100 110 120 130 140 

★ ★ ★ ★ ★ 

GAG AAA CTG GAC CCC AGG GTG GTT TAT AAA ATC CTC CAA TGA AGC TAC 



150 160 170 180 190 

★ * * * ★ ★ ★ ★ 

TAA CAT TAC TCC AAA GCA TAA TAT GAA AGC ATT TTT GGA TGA ATT GAA 



Met Lys Ala Phe Leu Asp Glu Leu Lys 

200 210 220 230 240 

★ ★ * ★ * * 

AGC TGA GAA CAT CAA GAA GTT CTT ATA TAA TTT TAC ACA GAT ACC ACA 



Ala Glu Asn lie Lys Lys Phe Leu Tyr Asn Phe Thr Gin lie Pro His 
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FIGURE 55B 

250 260 270 280 

* ★ ★ * ★ * * 

TTT AGC AGG AAC AGA ACA AAA CTT TCA GCT TGC AAA GCA AAT TCA ATC 



Leu Ala Gly Thr Glu Gin Asn Phe Gin Leu Ala Lys Gin lie Gin Ser> 

290 300 310 320 330 S 

* * ★ * ★ * * * 

CCA GTG GAA AGA ATT TGG CCT GGA TTC TGT TGA GCT AGC ACA TTA TGA 



Gin Trp Lys Glu Phe Gly Leu Asp Ser Val Glu Leu Ala His Tyr Asp> 

340 350 360 370 380 

* * * * * ★ *■ * ★ 

TGT CCT GTT GTC CTA CCC AAA TAA GAC TCA TCC CAA CTA CAT CTC AAT 



Val. Leu Leu Ser Tyr Pro Asn Lvs Thr His Pro Asn Tyr lie Ser lle> 

390 4 00 410 420 430 

AAT TAA TGA AGA TGG AAA TGA GAT TTT CAA CAC ATC ATT ATT TGA ACC 



lie Asn' Glu Asp Glv Asn Glu lie Phe Asn Thr Ser Leu pne Glu Pro 

440 450 460 470 480 

* * * * * * * * 

ACC TCC TCC AGG ATA TGA AAA TGT TTC GGA TAT TGT ACC ACC TTT CAG 



Pro Pro Pro Gly Tyr Glu Asn Val Ser Asp He Val Pro Pro Phe Ser> 
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FIGURE 55C 

490 500 510 520 

TGC TTT CTC TCC TCA AGG AAT GCC AG A GGG CGA TCT AGT GTA TGT TAA 

Hio phe Ser Pro Gin Gly Met Pro Glu Gly Asp Leu Val Tyr Vai ~*n> 



30 540 550 560 570 

CTA TGC ACG AAC TGA AGA CTT CTT TAA ATT GGA ACG GGA CAT GAA AAT 



Tyr Ala Arg Thr Glu Asp Phe Phe Lys Leu Glu Arg Asp Met Lys Ile> 



580 590 600 610 620 

***** 
CAA TTG CTC TGG GAA AAT TGT AAT TGC CAG ATA TGG GAA AGT TTT CAG 



*sn Cys ber Gly Lys lie Val He Ala Arg Tyr Gly Lys Val Phe Arg> 



630 640 



6 5 0 6 6 0 6.7 0 



AGG AAA TAA GGT TAA AAA TGC CCA GCT GGC AGG GGC CAA AGG AGT CAT 



Gly Asn Lys Val Lys As 



680 690 



Asn Ala Gin Leu Ala Gly *ia Lys Gly Val He: 



700 710 720 



★ 



TCT CTA CTC CGA CCC TGC TGA CTA CTT TGC TCC TGG GGT GAA GTC CTA 
Leu Tyr Ser Asp Pro Ala Asp Tyr Phe Ala Pro Gly Val Lys Ser Tyr> 
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FIGURE 55D 

730 740 750 760 

* ★ * * + * * 

TCC AGA TGG TTG GAA TCT TCC TGG AGG TGG TGT CCA GCG TGG AAA TAT 



Pro Asp Gly Trp Asn Leu Pro Gly Gly Gly Val Gin Arg Gly Asn lie 

770 780 790 800 810 

★ ★ * ★ ★ ★ 

CCT AAA TCT GAA TGG TGC AGG AGA CCC TCT CAC ACC AGG TTA CCC AGC 



Leu Asn Leu Asn Gly Ala Gly Asp Pro Leu Thr Pro Gly Tyr Pro Ala 

820 830 840 850 860 

★ ★ ★ ★ * ★ * * ★ 

AAA TGA ATA TGC TTA TAG GCG ?GG AAT TGC AGA GGC TGT TGG TCT TCC 



Asn Glu Tyr Ala Tyr Arg Arg Gly lie Ala Glu Ala Val Gly Leu Pro 

870 880 890 900 910 

* ★ ★ * * * ★ ★ 

AAG TAT TCC TGT TCA TCC AAT TGG ATA CTA TGA TGC ACA GAA GCT CCT 



Ser lie Pro Val His Pro lie Gly Tyr Tyr Asp Ala Gin Lys Leu . Leu 

920 930 940 950 960 

* * * ★ ★ ★* * *•* 

AGA AAA AAT GGG TGG CTC AGC ACC ACC AGA TAG CAG CTG GAG AGG AAG 



Glu Lys Met Gly Gly Ser Ala Pro Pro Asp Ser Ser Trp Arg Gly Ser 



FIGURE 55E 83/102 

970 980 990 1000 

■* ★ ★ * * 

TCT CAA AGT GCC CTA CAA TGT TGG ACC TGG CTT TAC TGG AAA CTT TTC 



Leu Lys Val Pro Tyr Asn Val Gly Pro Gly Phe Thr Gly Asn Phe Ser 

010 1020 1030 1040 1050 

★ * ★ ★ ★ * * *■ ■* * 

TAC ACA AAA AGT CAA GAT GCA CAT CCA CTC TAC CAA TGA AGT GAC AAG 

Thr Gin Lys Val Lys Met His lie His Ser Thr Asn Glu Val Thr Arg 

1060 1070 1080 1090 1100 

★ ★ ★ * ★ ★ ★ 

AAT TTA CAA TGT GAT AGG TAC TCT CAG AGG AGC AGT GGA ACC AG A CAG 



lie Tyr Asn Val lie Gly Thr Leu Arg Gly Ala Val Glu Pro Asp Arg 

. 1110 1120 1130 1140 1150 

★ ★ ★ ★ * 

ATA TGT CAT TCT GGG AGG TCA CCG GGA CTC ATG GGT GTT TGG TGG TAT 



Tyr Val lie Leu Gly Gly His Arg Asp Ser Trp Val Phe Gly Gly lie 

1160 1170 1180 1190 1200 

* ★ ★ * ★ ★ * * 

TGA CCC TCA GAG TGG AGC AGC TGT TGT TCA TGA AAT TGT GAG GAG CTT 



Asp Pro Gin Ser Gly Ala Ala Val Val His Glu He Val Arg S£r Phe 

1210 1220 1230 1240 

★ ★ * * ★ 

TGG AAC ACT GAA AAA GGA AGG GTG GAG ACC TAG AAG AAC AAT TTT GTT 



Gly Thr Leu Lys Lys Glu Gly Trp Arg Pro Arg Arg Thr He Leu Phe 
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FIGURE 55F 

1250 1260 1270 1280 1290 

* * * * ** * ** * 

TGC AAG CTG GGA TGC AGA AGA ATT TGG TCT TCT TGG TTC TAC TGA GTG 



Ala Ser Trp Asp Ala Glu Glu Phe Gly Leu Leu Gly Ser.Thr Glu Trp 

1300 1310 ' 1320 1330 1340 

* * * * * * * * * 

GGC AGA GGA GAA TTC AAG ACT CCT TCA AGA GCG TGG CGT GGC TTA TAT 



Ala Glu Glu Asn Ser Arg Leu Leu Gin Glu Arg Gly Val Ala Tyr lie 

1350 1360 1370 1380 1390 

******** 
TAA TGC TGA CTC ATC TAT AGA AGG AAA CTA CAC TCT GAG AGT TGA TTG 



Asn Ala Asp Ser Ser lie Glu Gly Asn Tyr Thr Leu Arg Val Asp Cys 

1400 1410 1420 1430 1440 

* * ** ★ ★* '★ ★* 

TAC ACC GCT GAT GTA CAG CTT GGT ACA CAA CCT AAC AAA AGA GCT GAA 



Thr Pro Leu Met Tyr Ser Leu Val His Asn Leu Thr Lys Glu Leu Lys 

1450 1460 1470 1480 

* ** ★ ** * ** 

AAG CCC TGA TGA AGG CTT TGA AGG CAA ATC TCT TTA TGA AAG TTG GAC 



ber fro Asp Glu Gly Phe Glu Gly Lys Ser Leu Tyr Glu Ser Trp Thr 
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FIGURE 55G 

490 1500 1510 1520 1530 

********* * 
TAA AAA AAG TCC TTC CCC AGA GTT CAG TGG CAT GCC CAG GAT AAG CAA 

Lys Lys Ser Pro Ser Pro Glu vn>» Ser Gly Met Pro Arg He ber Lys: 

1540 1550 1560 1570 1580 

******** * 
ATT GGG ATC TGG AAA TGA TTT TGA GGT GTT CTT CCA ACG ACT TGG AAT 

Leu Glv Ser Gly Asn Asp Phe Glu Val Phe Phe Gin Arg Leu Gly He 

1590 1600 1610 1620 1630 

* * * ** * ** * * 

TGC TTC AGG CAG AGC ACG GTA TAC TAA AAA TTG GGA AAC AAA CAA ATT 



Ala Ser Gly Arg Ala Arg Tyr Thr Lys Asn Trp Glu Tnr Asn Lys Phe 

1640 1650 1660 1670 1680 

* * ** * ** *.** 

CAG CGG CTA TCC ACT GTA TCA CAG TGT CTA TGA AAC ATA TGA GTT GGT 

* Ser Gly Tyr Pro i,eu Tyr Hi s Val Tyr Glu Thr Tyr Glu Leu Val 

1690 1700 1710 1720 

* ** * ** *** 

GGA AAA GTT TTA TGA TCC AAT GTT TAA ATA TCA CCT CAC TGT GGC CCA 



Glu Lys Phe Tyr Asp Pro Met Phe Lys Tyr His Leu Thr Val Ala uin 



FIGURE 55H 
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730 1740 1750 1760 1770 

GGT TCG AGG AGG GAT GGT GTT TGA GCT AGC CAA TTC CAT AGT GCT CCC 



Val Arg Gly Gly Met Val Phe Glu Leu Ala Asn Ser lie Val Leu Pro 

1780 1790 1800 1810 1820 

★ ★ ★ * * ★ ★ ★ * * 

TTT TGA TTG TCG AGA TTA TGC TGT AGT TTT AAG AAA GTA TGC TGA CAA 



Phe Asp Cys Arg Asp Tyr Ala Val Val Leu Arg Lys Tyr Ala Asp Lys 

1830 1840 1850 1860 1870 

★ ★ * * ★ * * * 

AAT CTA CAG TAT TTC TAT GAA ACA TCC ACA GGA AAT GAA GAC ATA CAG 



Tie Tyr Ser lie Ser Met Lys His Pro Gin Glu Met Lys Thr Tyr Ser 

1880 1890 1900 1910 1920 

* . * ★ * ★ * 

TGT ATC ATT TGA TTC ACT TTT TTC TGC AGT AAA GAA TTT TAC AGA AAT 



Val Ser Phe Asp Ser Leu Phe Ser Ala Val Lys Asn Phe Thr Glu lie 
1930 1940 1950 1960 

TGC TTC CAA GTT CAG TGA GAG ACT CCA GGA CTT TGA CAA AAG CAA CCC 



Ala Ser Lys Phe Ser Glu Arg Leu Gin Asp Phe Asp Lys Ser Asn Pro 
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FIGURE 551 

1970 1980 1990 2000 2010 

★ - ★ * ★ ★ * * * 

AAT AGT ATT AAG AAT GAT GAA TGA TCA ACT CAT GTT TCT GGA AAG AGC 



lie Val Leu Arg Met Met Asn Asp Gin Leu Met Phe Leu Glu Arg Ala> 



2020 2030 2040 2050 2060 

★ ★ ★ * * ★ * 

ATT TAT TGA TCC ATT AGG GTT ACC AGA CAG GCC TTT TTA TAG GCA TGT 



Phe lie Asp Pro Leu Gly Leu Pro Asp Arg Pro Phe Tyr Arg His Val> 

2070 2080 2090 2100 2110 

★ ★ ★ * * * * * * * 

CAT CTA TGC TCC AAG CAG CCA CAA CAA GTA TGC AGG GGA GTC ATT CCC 

lie Tyr Ala Pro Ser Ser His Asn Lys Tyr Ala Gly Glu Ser Phe Pro> 



2120 2130 2140 2150 2160 

★ ★ * * ★ *• * * 

AGG AAT TTA TGA TGC TCT GTT TGA TAT TGA AAG CAA AGT GGA CCC TTC 



Gly lie Tyr Asp Ala Leu Phe Asp lie Glu Ser Lys Val Asp Pro Ser> 



2170 2180 2190 2200 

★ ★ + * * 

CAA GGC CTG GGG AGA AGT GAA GAG ACA GAT TTA TGT TGC AGC CTT CAC 



Lys Ala Trp Gly Glu Val Lys Arg Gin lie Tyr Val Ala Aia Phe Thr> 



2210 2220 2230 2240 2250 

★ ★ * * * * * * 

AGT GCA GGC AGC TGC AGA GAC TTT GAG TGA AGT AGC CTA AGA GGA TTC 



Val Gin Ala Ala Ala Glu Thr Leu Ser Glu Val Ala 
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FIGURE 58 J 



2260 2270 2280 2290 2300 

********* 
TTT AGA GAA TCC GTA TTG AAT TTG TGT GGT ATG TCA CTC AGA AAG AAT 



2310 2320 2330 2340 2350 

* 

CGT AAT GGG TAT ATT GAT AAA TTT TAA AAT TGG TAT ATT TGA AAT AAA 



2360 2370 2380 

******* 
GTT GAA TAT TAT ATA TAA AAA AAA AAA AAA AAA AA 
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